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ABSTRACT 



It is of fundamental importance to determine if and how hierarchical 
clustering is involved in large-scale structure formation of the universe. 
Hierarchical evolution is characterized by rules which specify how dark matter 
halos are formed by the merging of halos at smaller scales. We show that 
scale-scale correlations of the matter density field are direct and sensitive 
measures to quantify this merging tree. Such correlations are most conveniently 
determined from discrete wavelet transforms. Analyzing two samples of Lya 
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forests of QSO's absorption spectra, we find significant scale-scale correlations 

q ■ whose scale dependence is typical for branching processes. Therefore, models 

which predict a "history" independent evolution are ruled out and the halos 

OS ■ hosting the Lya clouds must have gone through a "history" dependent merging 

^ ■ process during their formation. 
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1. Introduction 

The recent discoveries of the excess of faint blue galactic counts (Lilly et al. 1995, 
Ellis et al. 1996) and a substantial population of star forming galaxies at redshift z ~ 
3-3.5 (Steidel et al. 1996) may be taken as evidence of hierarchical structure formation 
(Kauffmann & White 1993; Lacey & Cole 1993; Navarro, Frenk & White 1996). However, 
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the predicted abundance of galaxies at higher redshifts depends on, at the very least, models 
of stellar population synthesis and the IMF. Besides these abundances, a more direct and 
independent detection of hierarchical evolution is necessary to form a convincing argument 
for this scenario as well as to discriminate among different models of structure formation. 

Scenarios of hierarchical matter clustering are defined in terms of rules which determine 
how dark halos evolve from scale to scale. Measuring correlations of structures at different 
scales is a direct and sensitive way to test the hypothesis of hierarchical clustering. We 
propose to detect this hierarchical structure by studying correlations between coefficients of 
a wavelet decomposition of the density field p(x). We suggest relevant measures to quantify 
such scale-scale correlations and to discriminate between different scenarios. Scale-scale 
correlation measures have been shown to effectively reveal hierarchical characteristics of 
energy transfer in turbulence and multiparticle physics (Yamada & Ohkitani, 1991, Greiner, 
Lipa, & Carruthers, 1995.) 

The hierarchical clustering model of galaxy formation generally refers to the 
assumption that the irreducible correlation functions are described by the hierarchical 
relations £ n = QnCi' 1 -, where £ n is the n-th order correlation function, and Q n are 
constants (White 1979). Obviously, if the hierarchical relations hold exactly, the two-point 
correlation function plus all Q n completely characterize the statistical features of galaxy 
formation, including higher order correlations such as scale-scale correlations. However, 
the hierarchical relations are only approximately fulfilled. Studies in turbulence in which 
fractal hierarchical relations only approximately hold showed that scale-scale correlations 
were still useful in describing statistical features (Yamada & Ohkitani, 1991.) Therefore, 
scale-scale correlations will be useful in revealing deviations from the linked pair ansatz, 
and to discriminate among models including those satisfying the hierarchical relations 
approximately. 



2. Scale-scale correlations 

In order to assess the discriminative power of scale-scale correlations we discuss two 
exactly solvable models of hierarchical clustering. The first is a Gaussian version of the 
block model (Cole & Kaiser 1988) employing an additive merging rule, while the second 
is similar in spirit, but uses a multiplicative merging rule (adapted from Meneveau & 
Sreenivasan 1987). Both models give identical first and second order statistics and are 
constructed in such a way as to reproduce the experimental power spectrum. However, 
their different merging rules imply a quite different structure of correlations beyond second 
order. 
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The Gaussian block model. A block of mass M distributed uniformly over a length 
L is successively divided by a factor two, giving 2 J blocks at mass scale M /2 J at the j-th 
iteration, while the length scale is L/2- 7 . Each block is labeled by the pair (j, I), where 
j — 0, . . . , J denotes the scale (with some appropriate cut-off scale J) and / = 0, . . . , 2 J — 1 
gives the position of a block at scale j. The mean mass density of all blocks is p = Mq/L. 
The density contrast is defined by e(x) = (p(x) — p)/~p. 

A realization of the density contrast field is then generated as follows: For the 
largest block (0, 0), the density contrast is eo,o = + e, where e is drawn from a Gaussian 
distribution with variance E. For the two blocks (1,0) and (1,1) at the next finer scale 
j = 1, the two density contrasts are respectively enhanced and diminished by an additive 
factor e 0j o which is again drawn randomly from a Gaussian, but with a different variance 
E . Iterating down to scale J, we obtain the additive merging rule of the Gaussian block 
model: 



where each is drawn independently from a Gaussian with variance E.,- which may depend 
on the scale j but not on the position /. It is clear that in this scenario e^, describing the 
difference between density contrasts on scales j + 1 and j at overlapping positions, does not 
depend on the density e^j in the parent block and is in this sense "history" independent. 

The finest scale distribution ejj yields a realization of the density distribution 
p(x) = p[l + e(x)], where e(x) = ej^ when Ll/2 J < x < L{1 + 1)/2 J . This p{x) is then to 
be compared to the observed density distribution on spatial resolution ~ L/2 J . In order to 
reproduce the power spectrum P(k) one has to choose for the variances E 2 and E 2 



where kj = ir2 j+1 /L. 

Since all ejj are constructed from sums of independent Gaussian random variables, the 
resulting density matter field p(x) is a Gaussian random field which, by definition, produces 
no genuine correlations beyond second order. 

The branching block model. To apply block models to dark halos, we should 
identify holes with blocks satisfying the condition of collapse. This procedure introduces a 
history dependence in the sense that now the density differences between adjacent scales do 
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depend on the value of the parent block. Since we are not concerned with numerical details 
at the moment, we simply simulate this history dependence as follows: For block (0,0), the 
density contrast is assigned just as in the Gaussian block model. The mass M of block (0, 0) 
is split into blocks (1, 0) and (1, 1) with unequal masses of M(l + olq)/2 and M(l — olq)/2 
or, with equal probability, vice versa. Obviously, mass is conserved at each evolution step. 
The density contrasts are thus enhanced or diminished by multiplicative random weights 
1 ± a , e.g. e 1>0 = (1 + a )(l + e 0)0 ) - 1 and e ltl = (1 - a )(l + e ,o) - 1- Here only the 
sign of ctj is a random variable, assigning the positive sign with probability 1/2 to the left 
and right sub-blocks respectively; the value of ctj is a scale-dependent constant in the range 
0<aj< 1. 

The general recurrence relation between the density contrast on adjacent scales j and 
j + 1 is now 

(l+e i+ i, a ) = (l±a j )(l + e j , l ) 
(l + e i+1)2I+1 ) = (lT^Kl + e,,), (4) 

describing a hierarchical branching process. The multiplicative structure of this merging 
rule renders the resulting density field p(x) non-Gaussian and leads to significant genuine 
correlations beyond second order. The difference between density contrasts on scales j and 
j + 1, given by e^i = ±«j(l + e^j), is now obviously history dependent. 

Although the Gaussian- and branching-block models have very different evolution 
rules (HD and (d), their first and second order moments can be made equal if <x,- is chosen 
recursively by 

S 2 

= (^ + S 2 )(l + a§)(l + a?)...(l + a?_ 1 )- (5) 

We thus arrive at two paradigmatic models with quite different merging scenarios with 
identical means and covariances. No measure based on first and second order statistics 
can discriminate between these scenarios. Clearly, what is needed is a measure of the 
hierarchical evolution by scale-scale correlations involving moments of order higher than 
two. 

The essential information of cluster formation is captured in the properties of the 
differences at two adjacent evolution steps. This is exactly the information obtained by a 
discrete wavelet transform (DWT). More generally, the coefficients ejj of any decomposition 

J-l 2^-1 

= EE l U ^iA x ) (6) 

3=0 1=0 
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with respect to a complete and orthogonal wavelet basis ipjj(x) (Daubechies 1992) provide 
similar information on density differences at adjacent scales; for the purpose of analysis, 
the particular choice of wavelet is of secondary importance and all will lead to comparable 
results (Pando & Fang 1996). 

The counts-in-cell (CIC) method has been applied to study the scale-dependence of 
clustering and has even been applied to hierarchical studies (Balian & Schaeffer 1989, 
Bromley 1994). However, there are differences between the CIC and DWT analysis. The 
basis of the DWT decomposition [eq.(6)] ipj,i{x) is orthogonal with respect to both j (scale) 
and I (position), while the basis (or window) for the CIC is orthogonal with respect to I, 
but not to j. Moreover, the Haar wavelet scaling functions (which are equivalent to the CIC 
with cubic cells) are not localized in Fourier (scale) space. It is not possible to effectively 
measure scale-scale correlations by a scale-mixed decomposition. On the other hand, 
hierarchical clustering is characterized by local relations between large and small structures, 
and therefore, the decomposition should also be localized in physical space. The DWT is 
constructed by an orthogonal, complete and localized (in both physical and Fourier space) 
basis. For these reasons, studying scale-scale correlations via the DWT is an effective tool. 

The statistical properties of the wavelet coefficients ijj of a random field e(x) are most 
conveniently obtained once the generating function 

^ (J H^] = (exp(|/E 1 E 1 ^^) (7) 

is known, where 77 represents the auxiliary variables rjjj and (. . .) denotes the ensemble 
average. For both block model versions discussed above one can directly translate the 
merging rules into recursion relations for their generating functions Z^ +1 ^ and at two 
adjacent scales. The explicit formulae may be found in more detailed expositions (Greiner, 
Lipa, & Carruthers 1995; Greiner et al. 1996) 

Thus, various correlation quantities can be calculated from Z^[rj\ by taking appropriate 
derivatives. For instance, the correlations between a wavelet coefficient of a block / at scale 
j and it's left sub-block 21 at scale j + 1 are found by 

1 d p+q Z^ 



(8) 
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Specifically, we use symmetric and normalized correlation measures with even orders p = q, 
henceforth called scale-scale correlations: 



\ e j,l € j+l,2l) 

2^i=o \ e j,u 2^i'=o \ e j+i,i' 
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As expected, for the Gaussian block model we obtain Cj' p = 1 for all p > 2, i.e. there 
are no scale-scale correlations of order greater than two. Higher order correlations for the 
branching block model have been calculated in Greiner et al. (1996); the p = 2 scale-scale 
correlations for a simplified version with scale-independent a 3 - = a are 



, 2>2 _ (l + 6a 2 + a 4 y 
(1 + a 2 ) 2 i 



cr = yi i.z,.' ■ no) 



adjacent scales and discriminate clearly between a Gaussian and branching scenario. 



Thus, the Cj' p provide a sensitive quantification of correlations between structures living at 



It is interesting to point out that if the hierarchical relations (White 1979) hold, the 
constant Q n can approximately be described by the scale-scale correlations. For instance, 
for n=4, we have 



23 ^ v 3,H 23+ 1 
- 1=0 * 1=0 



This relation shows that it is possible to test the assumption that the Q n are scale (or 
j)-independent by scale-scale correlations (Pando, J. et al. 1997.) 



3. An example: Lya absorption forests 

It is generally believed that the Lya forests of QSO absorption spectra are due to 
the absorption of pre-collapsed clouds in the density field of the universe (Fang et al. 
1996). Hierarchical clustering requires that both collapsed halos and pre-collapsed clouds 
undergo similar merging evolutions. As such, the Lya forests should be good candidates for 
detecting hierarchical clustering. 

We looked at two data sets of Lya forests. The first was compiled by Lu, Wolfe and 
Turnshek (1991, hereafter LWT). The total sample contains ~ 950 lines from the spectra 
of 38 QSO that exhibit neither broad absorption lines nor metal line systems. The second 
set is from Bechtold (1994, hereafter JB), which contains a total of ~ 2800 lines from 78 
QSO spectra, in which 34 high redshift QSO's were observed at moderate resolution. To 
eliminate the proximity effect, all lines with z > z ern — 0.15 were deleted from our samples 
(Pando & Fang 1996). These samples cover a redshift range of 1.7 to 4.1, and a comoving 
distance range from about D min = 2,300 /i -1 Mpc to D max =3,300 /i^Mpc, if go = 1/2 an d 
h= #o/100 km s" 1 Mpc" 1 . 

We make block trees from the largest block L = D max — D m i n with L/2 J , and 
j — 0, . . . , 9. The smallest block-size L/2 9 ~ 2 h -1 Mpc is about the scale where the effect 
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of line blending occurs. Moreover, since we will only study scale-scale correlations on scales 
equal to or larger than about L/2 8 ~ 5 h _1 Mpc, the influence of peculiar motions should 
be negligible. 

To reduce the influence of the z-dependence in the mean density p of Lya lines, we 
chop the entire red-shift space into segments with size Az = 0.4. This corresponds to 
a comoving space of 270 h~ 1 Mpc for the lowest z of the LWT and JB samples, and 110 
h _1 Mpc for the highest z of the samples. 

To account for the remaining z-dependence of p, 100 random samples for each data 
set are generated by shifting each observed line by a random distance SD not exceeding 
the interval distance corresponding to Az. Any line shifted outside the interval is wrapped 
around to bring it back into the interval. This procedure gives a de-correlated (random) 
sample which still reflects the z-dependence of the observed sample. Both, the observed 
and random samples are suitable for statistical analysis on scales less than 100 h^ 1 Mpc. 

The real data and random sample are subjected to the four-coefficient Daubechies 
discrete wavelet (D4), which is better localized in Fourier space than the Haar wavelet. Fig. 

2 2 m % 

1 shows the results for Cy of the LWT and JB samples with line widths > 0.32 A. Clearly, 

2 2 

the scale-scale correlation Cy for the observational data is significantly larger than unity 
and well above the random samples on all scales j > 5 (i.e. less than about 80 h -1 Mpc). 
More importantly, the two independent data sets, LWT and JB, show similar behavior. 
Thus, the detected scale-scale correlations seem to be an intrinsic feature of the clustered 
density field traced by Lya forests. The influence of the ^-dependence of p is estimated by 
the values of C|' 2 for the random samples (shaded regions in Fig. 1); these are slightly above 
unity, but can certainly not explain the strong j-dependence of the LWT and JB samples. 

The Lya absorption line distribution is a discrete process. The discreteness is a source 
of non-Gaussianity as it is very well known that Poisson noise is non-Gaussian. The question 
naturally arises as to whether the non-Gaussianity measured by CJ' 9 is caused by a Poisson 
process. This non-Gaussianity has been carefully studied in Greiner, Lipa and Carruthers 
(1995) and Fang and Pando (1997). The main conclusion is that the non-Gaussianity of 
Poisson noise is significant only on scales of the mean distance of nearest neighbors. The 
distributions on large scales are a superposition of the small scale field. According to the 
central limit theorem the non-Gaussianity of Poisson noise will rapidly and monotonously 
approach zero on larger scales. In our analysis the scales being studied are larger than 5 
h _1 Mpc which is much larger than the mean distance between nearest neighbor Ly-alpha 
lines. Cf q ^ 1 is not due to the discreteness of samples or noise, especially for larger scales. 

Fig. 1 also demonstrates that the branching block model reproduces the trend of the 
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observed data, while the Gaussian block model (C 2,2 = 1) certainly lacks a mechanism that 
produces the observed hierarchical correlation structure even when the ^-dependence of the 
mean number density of Lya clouds is taken into account. 




Fig. 1. — Scale-scale correlations C 2 ' 2 for Lya samples LWT (top) and JB (bottom) with 
line width larger than 0.32A. The curves show the values for the Gaussian block model 
(dash-dotted) and the branching block model (full line) with a = 0.34. The shaded areas 
are the ±a of 100 random samples for the LWT and JB cases respectively. 



4. Conclusion 

Scale-scale correlations Cf p and possible variants are viable statistical measures to 
discriminate between different scenarios of merging dynamics of large scale structures of 
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the universe. They provide a direct clue of how larger halos and clouds are built up from 
their substructures at smaller scales. The wavelet transform is a fast and convenient tool to 
obtain the necessary information on localized contributions to the matter density field at 
different scales. 

Just as the two-point correlation function 1 + > 1 is an indicator of spatial 
clustering, scale-scale correlations CJ' P > 1 indicate some sort of "history" dependence in 
hierarchical clustering schemes. The scale-scale correlations of the one-dimensional Lyct 
forests show, indeed, features expected by multiplicative hierarchical clustering. Similar 
features have been found in the case of hydrodynamical turbulence. One can conclude that 
the halos hosting Lya clouds must have undergone a "history" dependent evolution process 
in some way during their formation. 

Finally, it is not difficult to generalize this method to three dimensions. 
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relation between Q n and Cj' p . 
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